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TITLE 

The Time of Flight System on a Chip 

STATEMENT OF GOVERNMENTAL INTEREST 
[0001] This invention was made with U.S. Government support under contract nos. NAGW- 
4547 and NAGW5-8516 with NASA. The U.S. Government has certain rights in the 
invention. 

CROSS REFERENCE TO RELATED APPLICATIONS 
[0002] This application is related to and claims the benefit of U.S. Provisional Patent 
Application Serial No. 60/371,506, filed April 10, 2002 entitled "The Time of Flight 
System on a Chip (TOF chip)". 

BACKGROUND 

[0003] There is a broad family of science instruments that benefit from precise time interval 
measurement. These instruments include particle spectrometers, delay line UV 
imagers, laser range finding instruments, etc. In addition, for applications in space, 
size, weight, and reduced power dissipation are very desirable characteristics for 
these instruments. A key part of these systems is their electronics. An attractive 
solution with respect to the electronics is a very large scale integration (VLSI) 
System on a Chip (SoC) implementation. Such an implementation can dramatically 
reduce power dissipation, instrument size, and instrument weight significantly 
increasing the effectiveness of precise time measurement instruments. 
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SUMMARY 

3004] The present invention may be characterized as an integrated analog and digital 
time-of-flight (TOF) chip. TOF refers to the ability to capture and digitize precise 
time intervals. A time interval may be defined as the time between a start and stop 
electrical signal. The start and stop signals can be either analog or digital. The 
measurement of the unknown time interval is performed with reference to a known 
constant time interval equal to the period of an external clock. Thus, the three basic 
inputs to the TOF chip are the start signal, stop signal, and external clock signal. 
The result of the measurement is an 11 bit digital word output and a valid event 
trigger output that indicates a valid result on the 11 bit output bus. There are other 
inputs and outputs on the TOF chip including a separate standard microprocessor 
parallel readout bus, a serial bus, as well as control and test pins. 
;0005] The TOF chip is comprised of two major functional blocks and a variety of lesser 
functional blocks. The first major functional block includes two constant fraction 
discriminators (CFDs) for measuring analog start and stop signals. The second 
major functional block includes a time-to-digital converter (TDC). Other blocks 
include a phase locked loop (PLL) that provides a control function for the TDC to 
lock to the external clock. The function of the PLL is very important because it 
performs power supply, temperature and radiation compensation tasks. Valid event 
digital control logic is a function that rejects anomalous event situations such as 
double starts, double stops, and the like. Another function includes three event 
accumulators for counting the total number of starts, stops, and valid events. There 
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is a logarithmic compression function for the output and an averaging function for 
reducing single shot time jitter. 
)006]The complete CFD-TDC functionality is used when the start and stop signals are 
analog. TDC only functionality is used when the start and stop signals are digital. 
The TDC includes a core of 2048 very large scale integration (VLSI) elements. An 
electric pulse of duration equal to a start-stop time difference T is transmitted 
through the VLSI core. Core elements truncate the pulse by a certain fixed time 
(delta T) equal to a digitization step of the TDC commonly known as the least 
significant bit (LSB). As the pulse propagates through the VLSI core, its duration 
ultimately diminishes to zero and it disappears. The decoding of the position (N) 
within the core when this takes place is translated into a measure of the pulse length 
T where T = N * LSB. The LSB is easily defined by the external clock via the PLL 
and is in the range of-25ps to 2.5 ns. The maximum time interval measured directly 
by the TDC is T ma x = 2048 * LSB. Longer time intervals are measured by counting 
clock periods. The combination of fine measurement via the 2048 VLSI core 
elements and coarse measurement via clock periods provides a very large time 
interval measurement equal to 32 bits or, 2 32 * LSB, that has a resolution as fine as 
LSB. Thus, T max is approximately 0.1 seconds for an LSB of 25ps and 
approximately 10 seconds for an LSB of 2.5ns. 
[0007] The TOF chip of the present invention includes all of the analog and digital signal 
processing necessary for time interval acquisition and digitization in a single 
monolithic chip. It is a plug-and-play analog (and/or digital) input digital output 
device that uses very little power and provides tremendous resolution. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
3008] FIGURE 1 illustrates a block diagram of the time-of-flight (TOF) chip. 
3009] FIGURE 2 illustrates a block diagram of a constant fraction discriminator (CFD). 
3010] FIGURE 3 illustrates a block diagram of a time-to-digital converter (TDC). 
001 1] FIGURE 4 illustrates a transistor level diagram of a DLL cell. 

0012] FIGURE 5 illustrates a block diagram of an energetic particle sensor (EPS) system 

utilizing the TOF chip of the present invention. 
0013] FIGURE 6 illustrates the time of flight interval versus energy for four groups of ion 

mass categories. 

DETAILED DESCRIPTION 

;0014] A high level block diagram of a time of flight (TOF) chip according to the present 
invention is illustrated in FIGURE 1. The TOF chip includes two constant fraction 
discriminator (CFD) channels 102, one time-to-digital conversion (TDC) core 104, 
valid event logic circuitry 106, and event accumulators. The CFDs 102 interface to 
start-stop MCP anodes 110 via off-chip single transistor amplifiers 112. The TDC 
core 104 digitizes the start-stop time difference using 11 -bits, with reference to an 
off-chip precise clock. An integrated phase locked loop 114 (PLL) provides a time 
reference to the external clock and compensates for power supply, temperature, and 
other environmental variations. 

[001 5] the valid event logic 106 rejects invalid events such as double starts, stops not 
associated with a start in a predefined window, single event upset (SEU) caused 
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errors, and others. An adder/subtractor 116 optionally corrects for offsets. Four 
counters measure the total number of start, stop, valid event pulses and total 
number of events (valid or not) that are processed. For each valid event, a TOF 
result is provided directly onto a bus after a certain processing deadtime. The TOF 
result is an 1 1-bit linear word or an 8-bit quasi-log compressed word. 
D016] A block diagram of a CFD 102 channel is illustrated in FIGURE 2. Each CFD 102 
utilizes two comparators 202a, 202b, an analog delay line 204, and a voltage divider 
206 to form the constant fraction operation. The bottom comparator 202a is termed 
the arming comparator or leading edge discriminator and it fires when an input pulse 
crosses a threshold value. The top comparator 202b is termed the zero crossing 
comparator and it fires when the difference of its two inputs switches polarity. The 
delay line 204 is external to the chip. There is also an optional on-chip analog delay 
line with controllable delay in the range of 0.5ns to 2ns. Typical delay values are in 
the range 1 ns to 2 ns for input pulses with peaking times 3 ns to 6 ns. The 
comparator baseline is internally adjusted to 0.5 v dd . The comparators 202a, 202b 
are self-compensated to remove offsets with the help of feedback loops. The gain of 
the loops is quite large so that the offsets are compensated to less than 0.2 mv. The 
bandwidth of the two feedback circuits, which is also controllable off-chip, is selected 
low compared to the bandwidth of the forward chain for overall stability. 
[0017] A positive input pulse is applied to the arming comparator input Arm*. A delayed 
version of the pulse is applied to zero crossing comparator input Zcin-, while an 
attenuated version is applied to input Zcin+. The arming comparator 202a fires and 
sets the flip-flop when the input amplitude crosses a user-defined threshold. The 
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zero crossing comparator 202b fires at a constant fraction of the input pulse at a 
time theoretically independent of the amplitude. An AND combination 208 of the two 
comparators 202a, 202b produces a CFD output that, in terms of timing properties 
such as time walk and time jitter, is defined by the zero crossing comparator 202b. 
Both the arming and CFD outputs are available with both polarities. 
0018] For precise time pick-off, it may be necessary to operate at as high a speed as 
possible so a cascade comparator topology is used. It is possible to optimize the 
gain-bandwidth product by selecting the number of stages and the gain-bandwidth of 
the individual stages such that the overall low frequency gain is equal to: 

A r = Av r Av 2 -... -Av r ...-Av„ (1) 
where Av t is the gain of the I th stage. Each stage is comprised of an operational 
trans-conductance amplifier (OTA) 210. The transfer function of an OTA 210 is 
described by an integrator with a single dominant pole. Assuming equal bandwidth 
for all stages the frequency dependence of the total gain can be expressed as: 

f i Y 

A r (jo>) = (Av t ) n 



JtDT+1) 



where 



and 



1 „ gm/Co 

r 



Av, « gm/go 

in which gm, go, and Co are the trans-conductance, the output conductance, and the 
parasitic output capacitance for each OTA 210. Solving for a>_ 3dB yields: 
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co_ idB = 1-V2- - 1 (2) 

X 

3019] Although the gain increases with the number of stages, overall bandwidth 
decreases. Therefore, an optimum number of stages can be selected to maximize 
the bandwidth for a given total gain. In this example, the gain of each stage is 
selected at approximately 3.5 the nominal CFD current of approximately 1 .8 mA and 
the single stage pole is gm/Co « 2 x 10 9 rod / sec . 

0020] With n = 8 for the zero crossing comparator 202b the overall gain is 87 dB and the 
negative 3dB bandwidth is 30% of the single stage. With n = 5 for the arming 
comparator 202a the overall gain is 54 dB and the negative 3 dB bandwidth is 38% 
of the single stage. 

[0021] The TDC design is generic to cover a broad range of scientific applications utilizing 
different modes of operation. Some general requirements of the TDC 104 include: a 
capability to process both random and triggered events; a low dead time to process 
event rates up to 1 M/s; stability for temperature, power supply, and environmental 
effects such as radiation (TVE); built-in logic to rapidly reject non-valid events such 
as double starts, stops with no starts, etc; good integral non-linearity; and low power 
dissipation. 

[0022] The principle of operation of the TDC chip is described in FIGURES 3 and 4. 
Referring to FIGURE 3, the start-stop time difference is defined by the rising edges 
of two positive going signals entering a time difference generating logic (TDGL) 
block 302. The TDGL logic 302 additionally filters many non-valid events. A valid 
negative going pulse with duration equal to the start-stop time difference then enters 
a core of 2048 TDC cells 304. Each cell truncates the traveling pulse by a certain 
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fixed time amount At, equal to the least significant bit (LSB) of the TDC system. If 
the start-stop time difference is less than T max = 2048 x LSB, the traveling pulse will 
disappear at a certain cell location inside the core. Decoding of that cell location 
yields a digital reading of the event. Decoding is achieved via RS flip-flops 
contained in every cell. If the flip-flop of one cell is set and the flip-flop of the next 
cell is not, the position is decoded using a thermometer logic circuit. If start-stop is 
greater than T ma x, the system simply overflows and clears automatically to accept 
another event. 

0023] The basic element of the core, a TDC cell, is illustrated in FIGURE 4. The TDC cell 
400 is comprised of a current starving inverter 402 followed by a standard inverter 
404. The Vitrei voltage generated by a charge pump circuit 306, controls the rise 
time of a pulse t rX , at the output of the current starving inverter 402. 

;0024]The falling time t fx is defined by a pull down transistor. Thus, the output of the first 
stage is asymmetrical and electronically controlled. The second stage is a standard 
inverter 404 with properly sized pull-down and pull-up for symmetrical operation. The 
cumulative effect of the TDC cell to a negative pulse with duration T is a negative 
output pulse with reduced duration. The reduced duration is controllable and is 
expressed as delta - T. 

[0025] The delay introduced to a falling edge traveling through a TDC cell is: 

t f = t ri +t /2 
The delay introduced to a rising edge is: 

It is possible to design the cell so that approximately: 
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tf \ = * fl = hi 

Thus, the total width of the pulse after a TDC cell is reduced by: 

At = t,-t r z t ri - t r2 

We define the time truncation as: 

delta - T = At 
which is the LSB of the TDC system where: 

LSB = t n -t r2 (3) 
0026] The processing time required to digitize a start-stop time difference T that results in 
a code N is: 

T N =Nt r +T 

where t r has been defined as the delay introduced by a single cell to a traveling 
rising edge. We define the dead time (Tdt ) of the TDC as the time it takes to digitize 
the maximum time (T max ) that results in the maximum code. 

T^^N^+T^ (4) 
[0027] The TDC chip is designed to operate in full-core (11 -bits) where all 2048 cells are 
used, half-core (10-bits) where only 1024 cells are used, and quarter-core (9-bits) 
where only 512 cells are used. In each case, the LSB is the same and is defined by 
the clock. However, the maximum measurable time and the dead time scale 
accordingly. 
Thus: 

T mwn =2048 LSB 

^max 1024 = ^max 2048 ^ ^ 
^max 512 = ^max 2048 ' ^ 
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Tdt 2MS =2048 -t, +r imx 2048 
Tdt l02A = Tdt vm l2, 
Tdt 5l2 =Tdt 20AS I A. 

The smaller the dead time, the higher the event rate could be processed. 
;0028] The LSB as expressed in equation (3) above is a strong function of temperature, 
power supply and radiation induced MOS parameter shifts, unless a special control 
operation is applied to keep it stable. Stability is achieved by implementing a special 
PLL known as a delay locked loop (DLL). Parts of the DLL include a time reference 
generating logic (TRGL) block 308 and the charge pump 306. The TRGL 308 
applies a train of negative going pulses with pulse width T c ik equal to the period of 
the clock. The charge pump 306 adjusts the Vitrei error voltage, so that these 
pulses disappear on average at the end of the DLL delay line. The error voltage, 
Vcontroi Is low pass filtered and applied to all 2048 elements of the main TDC core. 

[0029] The TDC is designed to have two calibration modes of DLL. The first is an open 
loop DLL, with 2048-elements to make the event measurements and a separate 
closed loop DLL having 512 elements locked to a reference clock for the calibration. 
The second calibration mode is a closed loop DLL, having 2048 elements for both 
event measurements and the calibration. Open loop calibration mode is mostly used 
in random event cases such as space particles and UV. Closed loop calibration 
mode is used in triggered event cases such as laser range finding. Closed loop 
calibration mode can also be used for random events when it is acceptable to lose a 
certain number of events during calibration (typically 1% due to slow calibration). 

[0030] In open loop calibration mode, a pulse of width equal to the period of the reference 
clock is applied every 2048 (or 128) clock periods into the 512 element calibration 
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DLL. The sum of the 512 element delay is forced to be equal to the clock period, T C ir. 
The controlled voltage generated by the charge pump is applied to both the 512- 
element calibration DLL and the 2048-element open loop DLL as illustrated in 
FIGURE 3. The LSB in open loop calibration mode becomes: 

and the maximum measurable time becomes: 

rmax TOde _ fl =r 2048 =4-7WAr. 
[0031] In closed loop calibration mode, a pulse of width equal to the period of the reference 
clock is applied into the same 2048 element DLL in a time sharing mode with the 
event measurement. Time sharing is defined by a user supplied calibration trigger. 
Typically a trigger frequency of 1KHz is enough to compensate for environmental 
(TVE) effects. While calibrating, the system rejects incoming start-stop events, 
implying a loss of a small number of events when operating in random mode. The 
sum of the 2048 element delay is forced to be equal to the clock period, Tdk- The 
control voltage generated by the charge pump is then applied to the entire 2048 
element closed loop DLL. The LSB in closed loop calibration mode becomes: 

r W - Tclk (6) 

and the maximum measurable time becomes: 

^ max mod =r 2048 =Tclk . 
[0032] In the 0.8u CMOS technology used to fabricate some TOF chips, the cumulative 
delay of a two-inverter element is approximately 0.6 ns. In a typical case of a 10 
MHz-reference clock the following results are expected for open calibration mode, 
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an lsb ^ e . a « 200/>5 where the maximum measurable times and dead times are 
approximately 400 ns measurable time and approximately 1200 ns +400 ns dead 
time for full core, approximately 200 ns measurable time and approximately 600 ns + 
200 ns d time for the half core, and approximately 100 ns measurable time and 
approximately 300 ns + 100 ns d time for the quarter core. 
[0033] For closed loop calibration mode and a 10 MHz reference clock, the 
LSB,^^ « 50ps where the maximum measurable time is approximately 100 ns and 

dead time is approximately 1200 ns +100 ns. 

[0034] Another aspect of the DLL is matching between cells in the core and the resulting 
integral non-linearity (INL) error. However, if INL distribution is fixed and does not 
vary with TVE, it can be easily removed by post-calibration. 

[0035] An analysis was performed to derive general matching properties and design 
principles. Mismatches among the TDC cells will result in time errors. Process 
parameters such as transistor trans-conductance parameter, B = Ko * W/L threshold 
voltage, V t , and gate or diffusion capacitances, C, are not identical inside the whole 
chip area. Time truncation, or the LSB as defined in equation (3) of the single TDC 
cell of FIGURE 4, can be expressed as: 

LSB= ( , C > c T ]xK w -f — r -£* JxF We (7) 

where C, is the overall internal node capacitance, C a is the overall output 
capacitance, 0 P is the transconductance parameter of the pFETs, V t is the threshold 
voltage of the PFETs, V gs =Vconiroi is the gate source voltage that controls the delay of 
the current starved pFET, V dd is the power supply; Vmi is the gate-source voltage at 
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which the internal nFET pulls-down, and V| He is the gate-source voltage at which the 
external nFET pulls-down. Both V| H j and V| He are proportional to the threshold 
voltage Vm of the nFETs. The parenthetic terms of equation (7) approximate the 
inverse of the rise-time slew rates of the starved and the internal pFETs respectively. 
[0036] Assuming independence between all parameters in above expression, the 
normalized variance <5i 2 = 6 2 (ALSB/LSB) of the LSB can be expressed as: 



where 6 x 2 = 6 2 (AX/X). Each variance operator 5 2 in the above expression is 
inversely proportional to the transistor gate area, S 2 approximately 1/WL. In addition 
matching is improved with a large value of V gs - V t for all operating conditions. It also 
helps to use the maximum power supply possible. For the present discussion, much 
larger than minimum device areas are described and the cell was optimized to 
operate in the 50 ps to 500 ps range with V gs close to Vdd at the high end time 
resolution. 

[0037] It is interesting to investigate how device matching propagates in the DLL chain. The 
variance of the open (non-controlled) delay line after m delay elements is simply the 
sum of the variances of m elements: 



However, the variance of a closed delay line at the Nth element will be different, 

N 

because of the constraint that the sum of the individual delays £ T k is forced to be 



constant. The variance at the m element of a DLL of N total elements is described 
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by: 

<r*.=* % *[ mmiN N ~ m) ] ( 10 > 

The maximum time variance occurs in the middle of the delay line at m=N/2 with a 
value: 

2 N 2 (11) 

4 

Thus, the variance of the maximum time deviation of a closed DLL line is % the 
value of the open loop delay line. 
[0038] In open loop mode of operation, the measurement DLL is open and has 2048 
elements. The variance at the end will be: 

CT 2 2048 -open = 2048 • C7 * . 

[0039] In closed loop mode of operation, the DLL is closed and has 2048 elements. The 
variance at the end will be: 

& 2 2048 -closed =512 • O * 

and the variance in open loop mode is 4-times worse of that in closed loop mode. 
[0040] It is possible to fix the increased variance of open loop mode on a chip by chip basis 
if one measures the maximum processing time of a particular chip, 
rmax^^V-— in the fu " core selection - This measured value will 
generally have an offset because it is an open DLL where: 

^2048 ^measured = ^ " ^clk + ^offset (^) 

However, T 2Q4 ^ measured has a fixed value independent of the TVE variation and a new 
LSB can be defined as: 
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T 

T CD ~ 2043 _ measured 

mod e- a- measured ~ 2048 

[0041] With this measured LSB value the open loop DLL behaves like a closed loop DLL in 
that the total delay is fixed. The INL will again have minimum values at the 
beginning and at the end of the DLL and the maximum variance around the 
expected value based on the measured LSB will be in the middle and still equal to: 

N I4cr 2 \ = 512 -a, 2 . 
This is achieved because of the measurement of the offset error. 

[0042] The last consideration pertaining to the TDC is timing jitter error. In all modes of 
operation timing jitter is a monotonically increasing function of the start-stop time 
difference because of the additive effect of each DLL element. Special care is taken 
to minimize jitter such as crosstalk from clock lines with many guard-rings in the 
layout etc. The time jitter contribution of the TDC is much less than the overall time 
error introduced by the front-end (CFD jitter plus time walk). 

[0043] One common measurement in space is the detection of energetic particles. One 
application specific use for the TOF chip of the present invention is within a compact 
Energetic Particle Sensor (EPS) system. The TOF chip reduces EPS system mass 
to less than 0.5 Kg and power dissipation to less than 0.5 w. These are major 
improvements over the current state of the art. 

[0044] A top-level schematic that describes the principle of operation of such an EPS is 
shown in FIGURE 5. For each radiation event the instrument uses combined time-of- 
flight and energy measurements to discriminate ion species. 

[0045] Energetic particles randomly enter a detector head 502, from an 180° opening 
through a collimator 504. Particles penetrate a thin foil 506 in the front and hit an 
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assembly of (six in this case) Solid State Detectors (SSDs) 508 in the back. 
Secondary electrons produced in the front and back foils are electro-statically 
focused on corresponding start and stop MCP anodes. 

[0046] Processing of the start-stop time difference yields the speed of a particle, and 
processing of the total charge released in an SSD yields particle energy. Particle 
mass is determined by combining the TOF and energy measurements. In addition, a 
particular SSD address provides angular information. 

[0047] The goal of the EPS is to discriminate ions in four common mass categories as 
measured in space (H, He, O, Fe) as well as eight energy bands (10KeV/neucleon 
to 2MeV/neucleon). FIGURE 6 illustrates the time of flight interval versus energy for 
four groups of ion mass categories. In order to discriminate these four mass 
categories a time resolution of approximately 0.5 ns and an energy resolution of 
approximately 5 Kev FWHM are required depending on particular telescope 
geometries. A desirable event-processing rate is on the order of 0.1 to 0.2 M/s. 
Furthermore, it is highly desirable to improve the time and energy resolution in order 
to discriminate for isotopic mass species and push the lowest energy threshold to 
less than 1 Kev. To achieve these targets, the time and energy resolutions should 
be improved to less than 100 ps and less than 0.4 kev FWHM respectively. 
Additionally the event throughput are increased to greater than 1 M/s due to lower 
energy thresholds and the low power nature of the particle distributions. 

[0048] In the following claims, any means-plus-function clauses are intended to cover the 
structures described herein as performing the recited function and not only structural 
equivalents but also equivalent structures. Therefore, it is to be understood that the 
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foregoing is illustrative of the present invention and is not to be construed as limited 
to the specific embodiments disclosed, and that modifications to the disclosed 
embodiments, as well as other embodiments, are intended to be included within the 
scope of the appended claims. The invention is defined by the following claims, with 
equivalents of the claims to be included therein. 
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